(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 


(19) Worid Intellectual Property Organization 
International Bureau 

(43) International Publication Date 
21 February 2002 (21.02.2002) 



(10) International Publication Number 

PCT WO 02/14490 A2 


(51) International Patent Classification^: C12N 15/00 

(21) iDtemational Application Number: PCT/USO 1/25 166 

(22) International Filing Date: 10 August 2001 (10.08.2001) 

(25) Filing Language: English 

(26) Publication Language: English 


(30) Priority Data: 

60/224.948 


1 1 August 2000 ( 1 1 .08.2000) US 


(71) Applicant: GENENCOR INTERNATIONAL, INC. 
[US/US]; 925 Page Mill Road, Palo Alto, OA 94304 (US). 

(72) Inventorsj DIAZ-TORRES, Maria, R.; 142 Johnson Av- 
enue, Los Gatos, C A 94134 (US). SCHELLENBERGER, 
Volker; 914 Moreno Avenue, Palo Alto, OA 94303 (US). 
SELIFONOVA, Olga, V,; Apartment 318, 1405 Olive 
Lane North, Plymouth, MN 55447 (US). MORRISON, 
Thomas, B.; 3767 Redwood Circle, Palo Alto, CA 94306 
(US). LEE, Edwin, W.; 239 Sweeny Street, San Francisco, 
CA 941 34 (US). 


(74) Agent: BOYD, Victoria, L.; Genencor International, Inc., 
925 Page Mill Road, Palo Alto, CA 94304 (US). 

(81) Designated States (national): AE, AG, AL, AM, AT, AU, 
AZ, BA, BB, BG, BR, BY, BZ, CA, CH, CN, CO, CR, CU, 
CZ, DE, DK, DM, DZ, EC, EE, ES, FI, GB, GD, GE, GH, 
GM, HR. HU. ID, IL, IN, IS, JP, KE, KG, KP, KR, KZ, LC, 
LK, LR, LS, LT, LU, LV, MA, MD. MG, MK, MN, MW, 
MX, MZ, NO, NZ. PL, PT, RO, RU, SD, SE, SG, SI, SK, 
SL, TJ, TM, TR, TT, TZ, UA. UG, UZ, VN, YU, ZA, ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZW), Eurasian 
patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), European 
patent (AT, BE, CH, C_Y, DE, DK, ES, FI. FR, GB, GR, IE, 
IT, LU, MC, NL, PT, SE, TR), OAPI patent (BF, BJ, CF. 
CG, CI, CM, GA, GN, GQ, GW. ML, MR, NE, SN, TD, 
TG). 

Published: 

— without international search report and to be republished 
upon receipt of that report 

[Continued on next page] 


= (54) Title: BAQLLUS TRANSFORMATION, TRANSFORMANTS AND MUTANT LIBRARIES 

CLONING BY IN VITRO ASSEMBLY AND TRANSFORMATION OF COMPETENT BACILLUS 


< 
o 

ri 
o 


feftHB 


assembly (PCR 
and/or ligation) 


D is6q.2| f kan | | right HB "j 


r tefl MB I seq, 1 1 seq. 2 1 Ran | right HB "[ 
transformation | 



recipient 
chromosome 


feft HB 


right HB 


chromosomal 
integration 


resulting 
Chromosome 


/ leftHS lseq.l[seq.2| kan I rightHS" 


7 


I (57) Abstract: The present invention provides methods for building DNA constructs in vitro, transforming such constructs into 
. competent Bacillus strains with good efficiency, and generating populations of mutants. Also provided is a method to assemble 
' DNA constructs in situ. 
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BACILLUS TRANSFORMATION, 
TRANSFORMANTS AND MUTANT LIBR/5cRfES 

BACKGROUND OF THE INVENTION 

1 . Field of the Invention 

The present invention relates to Bacillus transformation, transformants, and mutant 
libraries. 

2. Background 

A widely used known method for altering the chromosome of Bacillus involves 
building plasmid constojcts and transfomning them into E. coli. Subsequently, the plasmids 
are isolated from E, coli and transfomned into Bacillus. Widespread use of this method can 
be attributed, at least in part, to the notion that E. coli is easier to transform than Bacillus. In 
this regard, the in vitro ligation of plasmids results in nicked products that can transfonn E. 
coli but do not transform Bacillus, 

The conventional approach to constructing libraries in Bacillus is based on 
replicating plasmids. Such an approach, unfortunately, is generally associated with a 
number of disadvantages, including: 

1) One needs an antibiotic or other selectable marker to maintain the plasmid in the 
cells. This is not desirable for production strains and it constrains the choice of 
screening conditions. 

2) Genes on the plasmid are present in multiple copies. This affects gene regulation 
and expression. 

3) Variations in copy number can skew a library, i.e., one may preferentially identify 
clones with increased copy number instead of improved gene function. 
Alternatively, integrating plasmids or vectors may be used. Integrating vectors do 

not contain an origin of replication and therefore require insertion into the host chromosome 
to be stably maintained. However, these are not without problems. Integration occurs via a 
Campbell-type recombination event that results in a duplication of the cloned region at either 
end of the Inserted (now linear) vector. Depending on the position of the integration genes 
may be disrupted resulting in poor transfonnation efficiency. 

With either method there is still a need to generate sufficient amounts of the desired 
sequence to effect an efficient transfomiation. This was usually accomplished by inserting 
the desired sequence into a shuttie vector tfiat was inserted into E. coll, allowed to replicate 
to a high copy number, and recovering the amplified DNA. This process could mn into 
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problems due to sequence size; the larger the sequence the more difficult it could be to 
insert and replicate. Additionally, there were sequences that would not insert or replicate In 
E. coll resulting in a loss of diversity In the DNA library that was being built Finally, the high 
copy number of some plasmids/vectors is deleterious to E. coli. 

The prior art methods failed to reproducibly render cells hypercompetent nor did they 
generate large libraries easily in Bacillus and other host cells. In order to generate a small 
library the prior art utilized E. coli to amplify DNA of interest to obtain a sufficient quantity for 
transformation of host cells. The methods provided herein allows the generation of large 
libraries in a reproducible manner without the use of E. coli. 

Thus, there is a need for a Bacillus transfomriation method that is relatively 
straightfoHA^ard, efficient and reproducible. In particular, a method is needed that permits 
the efficient transformation of Bacillus, without requiring intervening steps involving the use 
of additional microorganisms, such as E. coli. Particularly advantageous would be a 
transfonnation method that is amenable to the construction of mutant libraries, and which 
avoids or overcomes one or more of the above-mentioned disadvantages. 

SUMMARY OF THE INVENTION 

The present invention provides methods for building polynucleotide constructs in 
vitro, directly transfomning such constructs into competent Bacillus species and/or strains 
with good efficiency, and generating populations of mutants {e.g., a mutant library). 

In one of its aspects, the present invention provides a method of producing a 
transformed microorganism. According to one embodiment, the method includes the steps 
of: 

(i) selecting a competent microorganism of the genus Bacillus; 

(ii) producing a polynucleotide constaict in vitro; and 

(iii) directly transforming the microorganism with the construct such that the 
construct becomes integrated into a chromosome of the microorganism. 

In one embodiment, the construct includes mutagenized DNA. 

In another embodiment, the construct includes a sequence of interest, flanked on 
each side by a homology box. Optionally, the constaict can additionally include non- 
homologous outer flanks. 

According to one embodiment, the construct is a non-plasmid DNA construct. 

In one embodiment, the competent microorganism of the genus Bacillus is an ultra- 
competent strain, preferably Pxyl-comK. 
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In accordance with one embodiment, the above method additionally includes the 
steps of (i) selecting a target region in a chromosome of the competent Bacillus, and (ii) 
increasing (e.g.. maximizing) the homology between the target region and the construct. 

Another aspect of the present invention provides a library of mutants produced by 
the above method. 

A further aspect of the present invention provides a method for the directed evolution 
of a sequence in the Bacillus chromosome. One embodiment of the method includes the 
steps of: 

(i) in vitro mutagenesis of a selected sequence, 

(ii) direct transfomiation of the mutagenized sequence into a competent Bacillus, 
e.g., a Bacillus carrying Pxyl-comK, 

(iii) screening for, or selection of, mutants possessing or exhibiting a desired 
property; and 

(iv) repeating steps (i)-(iii) for one or more rounds. 

Advantageously, the metiiods disclosed herein allow one to evolve single-copy 
genes of a competent Bacillus strain. 

In another of its aspects, the present invention provides a metiiod for constructing a 
polynucleotide sequence in a target locus of a selected recipient strain, wherein the strain 
includes a selectable marker residing at the target locus. One embodiment of the method 
includes Uie steps of: 

(i) assembling a construct comprising a sequence of interest, a selectable 
marker that differs from the residing marker of the recipient strain, and two flanks which are 
homologous to sequences of the target locus; 

(ii) transforming the recipient strain with the constnjct under conditions 
pemiitting Uie incoming sequence and selectable marker to replace tiie residing marker, and 
selecting for transfomiants that include the Incoming selectable marken 

(iii) repeating steps (i) - (iii), with the newly inserted selectable marker acting as 
the residing marker. 

Optionally, after step (ii) the following additional step can be performed: 

testing the ti^ansformants for loss of tiie residing marker, ttiereby verifying that tfie 

construct was incorporated into Uie con-ect locus of the chromosome. 

These and otiier features, aspects and advantages of tiie present invention will 

become apparent from tiie following detailed description, in conjunction witii the appended 

claims. 
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DESCRIPTION OF THE DRAWINGS 

Figure 1 is a schematic diagram siiowing cloning by in vitro assembly and 
transfonnation of competent Bacillus, in accordance witii the present invention. 

Figure 2 illustrates, in schematic fashion, the addition of non-homologous flanl<s to 
the assembled sequences to increase transformation efficiency, in accoixiance with the 
present invention. 

Figure 3 is a schematic diagram illustrating PGR mutagenesis of a region of the 
Bacillus chromosome, in accordance with the present invention. 

Figure 4 is a schematic diagram illustrating that maximizing the homology between 
the transfomiing DNA and the target region of the chromosome can increase the 
transformation efficiency, as taught by the present invention. 

Figure 5 illustrates, in schematic fashion, using a competent host that carries an 

inactive version of the marl<er gene, used to select transformants, as taught by the present 
invention. 

Figure 6 shows representative structures of transforming DNA, according to the 
teachings herein. At top, homology boxes flank an incoming sequence. At center, other 
non-homologous sequences are added to the ends. At bottom, the ends are closed such 
that the transforming DNA fonns a closed circle or loop. 

Figure 7 illustrates, in schematic fashion, Bacillus strain construction by iterative 
mari<er replacement, in accordance with the teachings of the present invention. 

Figures 8A & B: Figure 8A is a schematic illustration of the DNA construct used in 
Example 5 wherein the homology box length was varied. Figure 8B is a graph illustrating 
that PGR fragments containing the gene of interest, a selectable marker and varying lengths 
of flanking chromosome can be used for transfonnation directly into Bacillus (crosses), 
cloned into a plasmid and used for transformation either as an uncut plasmid (closed circles) 
or a linear plasmid (open circles). 

Figure 9 is a schematic illustration of the mutagenized DNA fragment used in 
Example 2. It is 6.8kb long comprising a left homology box (approx. 2.2 kb). the gene of 
Interest and selectable marker (approx. 2.4 kb), and a right homology box (approx. 2.1 kb). 

Figure 10 is a schematic of a three piece PGR fusion construct. The figure also 
shows the location where the primers align with a sequence within the DNA construct. 

Figure 1 1 depicts an exemplary method of adding nonhomologous flanks to the DNA 
construct. The DNA construct is Inserted into a plasmid. amplified and cut with restriction 
enzymes to add non-homologous flanking regions. 

Figure 12 is a representation of a vector useful in the present invention. In this 
vector two Bbs I sites have been engineered into the vector. Bbs I is a type lis restriction 
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enzyme. Other type lis enzyme site may be engineered into the vector instead of the Bbs I 
site. Thus, the Bbs I site is illustrative and not limitative. The vector is cut with Bbs I and the 
DMA construct is inserted into the vector. 

Rgure 13 is a schematic of the process used to prepare the insert that was 
subsequently ligated Into the vector. 

Figure 14 is a photograph of a gel showing that the ligation reaction produced large 
molecular weight ligation products. The gel is a 1 .2% agarose gel. Lane 1 was loaded with 
2 ul of the ligation product. Lane 2 was loaded with 2 ul of the linearized vector (i.e. the 
vector digested with Bbs I). Lane 3 contained 250 ng of Roche ladder X standard molecular 
weight markers. 

Figure 15 depicts the modification of a gene of interest. In the figure the MetB gene 
is modified so that 621 bp are deleted. The full length metB is 672 bp and thus this is not a 
full gene deletion. The primer N1, N2, N3 and N4 are shown vvith their relative alignment 
positions. 


DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides methods for building ONA constructs in vitro, 
transforming such constaicts into competent Bacillus strains with good efficiency, and 
generating populations of mutants in Bacillus. 

Definitions: 

Unless defined othenwise herein, all technical and scientific terms used herein have 
the same meaning as commonly understood by one of ordinary skill in the art to which this 
invention belongs. Singleton, et a!., Dictionary of Microbiology and Molecular 
BIOLOGY, 2d Ed.. John Wiley and Sons, New York (1994), and Hale & Marham. The Harper 
COLUNS DICTIONARY OF BIOLOGY, Harper Perennial, NY (1 991 ) provide one of skill with a 
general dictionary of many of the temis used in this invention. Although any methods and 
materials similar or equivalent to those described herein can be used in the practice or 
testing of the present invention, the prefenred methods and materials are described. 
Numeric ranges are inclusive of the numbers defining the range. Unless othenvise 
indicated, nucleic acids are written left to right in 5' to 3' orientation; amino add sequences 
are written left to right in amino to carboxy orientation, respectively. The headings provided 
herein are not limitations of the various aspects or embodiments of the invention which can 
be had by reference to the spedfication as a whole. Accordingly, the terms defined 
immediately below are more fully defined by reference to the specification as a whole. 
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Transforming DNA or DNA construct 

The transforming sequence or transfonning.DNA is generated in vitro by PGR or 
other suitable techniques. The typical structure of transforming DNA is shown in a 
schematic form in Figure 6. The transfonning DNA comprises an incoming sequence. It 
may further comprise an incoming sequence flanked by homology boxes. In a further 
embodiment, the transforming DNA may comprise other non-homologous sequences, added 
to the ends, i.e., stuffer sequences or flanks. The ends can be closed such that the 
transforming DNA forms a closed circle. 

Transforming DNA is DNA used to introduce sequences into a host cell or organism. 
The DNA may be generated in vitro by PGR or any other suitable techniques. In a prefen-ed 
embodiment, mutant DNA sequences are generated with site saturation mutagenesis in at 
least one codon. In another preferred embodiment, site saturation mutagenesis is performed 
for two or more codons. In a further embodiment, mutant DNA sequences have more than 
40%, more than 45%, more than 50%. more than 55%, more than 60%, more than 65%, 
more than 70%, more than 75%, more than 80%, more than 85%, more than 90%. more 
than 95%, or more than 98% homology with the wild-type sequence. Alternatively, mutant 
DNA may be generated in vivo using any known mutagenic procedure such as, for example, 
radiation, nitrosoguanidine and the like.. The desired DNA sequence is then isolated and 
used in the methods provided herein. 

The transforming sequences may be wild-type, mutant or modified. The sequences 
may be homologous or heterologous. Transforming sequence and DNA construct may be 
used interchangeably. 

Incoming sequence 

This sequence can code for one or more proteins of interest. It can have other 
biological function. In many cases the incoming sequence will include a selectable marker, 
such as a gene that confers resistance to an antibiotic. 

An incoming sequence as used herein means a DNA sequence that is newly 
introduced into the Bacillus chromosome or genome. The sequence may encode one or 
more proteins of interest. An incoming sequence comprises a sequence that may or may 
not already present in the genome of the cell to be transformed, i.e., either a homologous or 
heterologous sequence (defined herein). 

In one embodiment, the incoming sequence encodes a heterologous protein, said 
protein(s) including, but not limited to homiones, enzymes, growth factors. In another 
embodiment, the enzyme Includes, but is not limited to hydrolases, such as protease, 
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esterase, lipase, phenol oxidase, permease, amylase, pullulanase, cellulase, glucose 
isomerase, laccase and protein disulfide isomerase. 

In a second embodiment, the incoming sequence may encode a functional wild-type 
gene or operon, a functional mutant gene or operon, or a non-functional gene or operon. 
The non-functional sequence may be inserted into a target sequence to disrupt function 
thereby allowing a determination of function of the disrupted gene. 

Flanking Sequence 

A flanking sequence as used herein means any sequence that is either upstream or 
downstream of the sequence being discussed, e.g., for genes ABC, gene B is flanked by 
the A and C gene sequences. In a prefen-ed embodiment, the incoming sequence is flanked 
by a homology box on each side. In a more preferred embodiment, the incoming sequence 
and the homology boxes comprise a unit that is flanked by stuffer sequence (as defined 
herein) on each side. A flanking sequence may be present on only a single side (either 3' or 
5') but it is preferred that it be on each side of the sequence being flanked. 

Stuffer Sequence 

Stuffer sequence means any extra DNA that flanks the homology boxes, typically 
vector sequences, but could be any non-homologous DNA sequence. Not to be limited by 
any theory, a stuffer sequence provides a noncritical target for a cell to initiate DNA uptake. 

Wild-tvpe genes 

The terms "wild-type sequence," or "wild-type gene" are used interchangeably and 
refer to a sequence native or naturally occurring in a host cell. The wild-type sequence may 
encode either a homologous or heterologous protein. A homologous protein is one the host 
cell would produce without intervention. A heterologous protein Is one that the host cell 
would not produce but for the intervention. 

Mutant genes 

The terms "mutant sequence," or "mutant gene" are used interchangeably and refer 
to a sequence that has an alteration in at least one codon occurring in a host celFs wild-type 
sequence. The expression product of the mutant sequence is a protein with an altered 
amino acid sequence relative to the wild-type. The expression product may have an altered 
functional capacity, e.g., enhanced enzymatic activity and the like. 

Modified genes 

The terms "modified sequence" or "modified genes" are used interchangeably and 
means a deletion, insertion or intenuption of naturally occurring nucleic acid sequence. The 
expression product of the modified sequence may be a truncated protein if the modification 
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is a deletion or interruption of the sequence. The truncated protein may retain biological 
activity. The expression product of the modified sequence may be an elongated protein if 
the modification is an insertion into the nucleic acid sequence. An insertion may lead to a 
truncated protein as the expression product if the insertion results in the formation of a stop 
codon. Thus, an insertion may result in either a truncated protein or an elongated protein as 
an expression product. 

Host cell 

"Host cell" means a cell that has the capacity to act as a host and expression vehicle 
for an incoming sequence according to the invention. In one embodiment, the host cell is a 
microorganism. In a preferred embodiment according to the present invention, "host cell" 
means the cells of Bacillus. As used herein, the genus Bacillus includes all members known 
to those of skill in the art, including but not limited to B. subtilis, B, licheniformis, S. lentus, S. 
brevis, S. stearothemnophilus, B. alkalophilus, B. amyloliquefaciens, B. coagulans, S. 
ciculans, B. lautus and B, ttiuringiensis. Other cells useful in the present invention include 
Acinetobacter, Thermus, Deinococcus Radiodurans. 

Homologous sequence 

A homologous sequence is a sequence that is found in the same genetic source or 
species. For example, the host cell strain may be deficient in a specific gene. If that gene is 
found in other strains of the same species the gene would be considered a homologous 
sequence. 

Heterologous sequence 

A heterologous sequence is a sequence derived from a separate genetic source or 
species. A heterologous sequence Is a non-host sequence, a modified sequence, a 
sequence from a different host cell strain, or a homologous sequence from a different 
chromosomal location of the host cell. 

Homology box 

Homology boxes may flank each side of the incoming sequence. The sequence of 
each homology box is homologous to a sequence In the Bacillus chromosome. These 
sequences direct where in the Bacillus chromosome the new construct gets integrated and 
what part of the bacillus chromosome will be replaced by the incoming sequence. 

Chromosomal mteqration 

This is a process where the incoming sequence is introduced Into the Bacillus 
chromosome. The homology boxes of the transfonning DNA will align with homologous 
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regions of the chromosome. Subsequently, the sequence between the homology boxes is 
replaced by the incoming sequence in a double crossover (i.e., homologous recombination). 

Homologous Recombination 

Homologous recombination means the exchange of DNA fragments between two 
DNA molecules or paired chromosomes (during crossing over) at the site of identical 
nucleotide sequences. In a pretended embodiment, chromosomal integration is by 
homologous recombination. 

Target Sequence 

A target sequence as used herein means the DNA sequence in the host cell that 
encodes the sequence where it is desired for the incoming sequence to be inserted into the 
host cell genome. The target sequence may encode a functional wild-type gene or operon, 
a functional mutant gene or operon, or a non-functional gene or operon. 

Selectable Markers 

Selectable markers are usually genes that confer antibiotic resistance or a metabolic 
advantage on the host cell to allow cells containing the exogenous DNA to be distinguished 
from cells that have not received any exogenous sequence during the transformation. A 
residing selectable marker is one that is located on the chromosome of the microorganism 
to be transfomned. A residing selectable marker encodes a gene that is different from the 
selectable marker on the transforming DNA construct. 

Sequence of Interest 

As used herein, a sequence of interest may be an incoming sequence or a sequence 
to be generated in situ. The terms gene of interest and sequence of interest may be used 
interchangeably herein. 

Library of mutants 

A population of cells which are identical in most of their genome but include different 
homologues of one or more genes. Such libraries can be used, for example, to identify 
genes or operons with improved traits. 

Super competent or Hvpercompetent 

As used herein, hypercompetent means that greater than 1% of a cell population is 
tnansformable with chromosomal Bacillus DNA. Alternatively, hypercompetent means that 
greater than 10% of a cell population is transfomnable with a self-replicating Bacillus plasmid. 
Preferably, the super competent cells will be transformed at a rate greater than observed for 
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the wild-type or parental ceil population. Super competent and hypercompetent are used 
interchangeably herein. 

Embodiments: 

Although Bacillus is used throughout the specification it should be understood that 
any competent cell may be used in the inventive methods disclosed herein. 

Figure 6 depicts the DNA constructs that find use in the present invention. Briefly, in 
one embodiment, the DNA constmct comprises an incoming sequence flanked by homology 
boxes on each side, i.e., there is a left homology box and a right homology box, and may be 
referred to as a basic DNA constnjct. In a second embodiment the basic DNA construct 
further comprises flanking sequences, i.e., stuffer sequences, on each end and may be 
refen-ed to as a flanked DNA construct. In another embodiment, the flanked DNA construct 
is circularized and may be referred to as a circular DNA construct. The circular DNA 
construct may comprise plasmid DNA or it may comprise non-plasmid DNA in the portion 
represented by a thin line linking the ends of the flanking sequences, i.e., the flanking 
sequences' free ends should there be no circularization. in Figure 6. 

The incoming sequence may encode more than one protein. As shown in Figure 1 
the DNA construct comprises a left homology box, an incoming sequence comprising a first 
sequence (seq. 1) and a second sequence (seq. 2), a selectable mariner (here, for example 
purposes only, the antibiotic marker confem'ng kanamycin resistance, kan, is used), and a 
right homology box. It should be noted that the figure is not limifing on the inventive method 
and that more than two sequences may comprise the incoming sequence, i.e., there may be 
a third sequence (seq. 3), etc. 

The first and second sequences may encode different and distinct proteins, either full 
length or portions thereof. For example, the first sequence may encode a protease (or 
portion thereof) and the second sequence may encode a hormone (or portion thereof). 

Alternatively, the first and second sequences may encode different portions of the 
same protein. For example, the first sequence may encode the amino terminal and the 
second sequence may encode the carboxy terminal of a single protein. This would allow 
either or both of the sequences to be selectively mutagenized with different mutagenizing 
protocols being used. Or the carboxy and amino temiinal sequences of a protein may be 
joined while omitting an intervening sequence found in the native protein. 

As another option, Uie first and second sequences may encode variants of a single 
protein. Thus, for example, sequence 1 may encode Type A hemoglobin while sequence 2 
encodes Type S hemoglobin. 
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The various components of the DNA constaict may be assembled by PGR and/or 
ligation. It should be noted that any technique may be used as long as the DNA construct 
has the final configuration desired. 

Once the DNA construct is assembled in vitro it may be used to: 1) insert 
heterologous sequences into a desired target sequence of a host cell, or 2) mutagenize a 
region of the host cell chromosome, i.e., replace an endogenous sequence with a 
heterologous sequence, or 3) delete target genes. 

As noted in Figure 1, the recipient chromosome will possess sequences homologous 
and/or complementary to the homology boxes of the DNA construct. The homology boxes 
of the DNA construct will align with the homologous region of the recipient chromosome. 
The DNA construct will then insert into the recipient chromosome, preferably via 
homologous recombination. 

The DNA construct may further comprise flanking, non-homologous sequences, i.e., 
stuffer sequences, and is illustrated in Figure 2. The addition of non-homologous 
sequences, as shown below, increases the transformation efficiency. Not to be limited by 
theory, applicants propose the following mechanism. The mechanism of transformation of 
competent Bacillus is described in Dubnau, D. (1993) Bacillus subtilis and other gram- 
positive bacteria 555-584. Briefly, the transforming DNA binds to the cell and subsequently 
one strand is cleaved. The heterologous DNA is taken up by the cell starting from this 
cleavage site. If the initial cleavage occurs between the homologous flanks (shown in 
yellow) then chromosomal integration by double crossover becomes impossible. In an 
embodiment of the present invention, non-homologous flanks are added to the assembled 
sequences to increase transfonnation efficiency. Adding flanks to the transforming DNA, as 
taught herein, increases the probability that the DNA after being taken up will still retain both 
homologous regions that are required for chromosomal integration. This leads to an 
increase in transformation effidency. 

In one general embodiment, the present invention involves assembling a DNA 
construct in vitro, followed by direct cloning of such construct into a competent Bacillus, 
such that the construct becomes integrated into a chromosome of the Bacillus. For 
example, PGR fusion and/or ligation can be employed to assemble a DNA construct in vitro. 
In a preferred embodiment, the DNA construct is a non-plasmid DNA construct. In one 
embodiment, the DNA construct comprises a DNA into which a mutation has been 
introduced. Bacillus can then be transformed with the DNA construct. In this regard, highly 
competent mutants of Bacillus are preferably employed to facilitate the direct cloning of the 
constructs into the cells. For example. Bacillus canrying the comK gene under the control of 
a xyiose-inducible promoter (Pxyl-comK) can be reliably transformed with very high 
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efficiency, according to the teachings herein. Direct transformation means that an 
intermediate cell is not used to amplify, or othenA^ise process, the DNA construct prior to 
introduction into the host cell. Introduction of the DNA construct into the host cell includes 
those physical and chemical methods known in the art to introduce DNA into a host cell 
without insertion into a plasmid or vector. Such methods include but are not limited to 
calcium chloride precipitation, electroporation, naked DNA, liposomes and the like. The 
DNA constructs may be co-transformed with a plasmid without being inserted into the 
plasmid. A library of mutants can be generated. 

Figure 1 illustrates how DNA sequences can be assembled and moved into the 
Bacillus chromosome, according to the teachings herein. In a preferred embodiment, parts 
of the assembled sequence are random. As a result, a population of mutants can be 
obtained, where a single copy of the mutated sequence has been integrated into the 
Bacillus chromosome. 

As previously discussed, a widely used prior method for altering the chromosome of 
Bacillus involves building plasmid constmcts and transforming them into E. coll. 
Subsequently, the plasmids are isolated from E. coli and transformed into Bacillus. The 
present invention, in contrast, provides in vitro construction and direct transfonnation into 
Bacillus, without the use of any such intervening microorganisms. 

As also discussed above, the conventional approach to constructing libraries in 
Bacillus is based on replicating plasmids. Such an approach, unfortunately, is associated 
with a number of disadvantages, including: 

1 One needs antibiotic or other selectable marker to maintain the plasmid in the cells. 
This is not desirable for production strains and it constrains choice of screening 
conditions. 

2 Genes on the plasmid are present in multiple copies. This affects gene regulation 
and expression. The approach herein, on the other hand, allows one to evolve 
single copv genes of a strain. 

3 Variations in copy number can skew a library, i.e. one may preferentially identify 
clones with increased copy number instead of improved gene function. 

It will be appreciated that the present invention overcomes such problems associated with 
the use of replicating plasmids. 

Multimerize the assembled sequence 

According to one embodiment of the present invention, the transforming DNA can be 
multimerized. for example, by ligation. This has a similar effect as adding non-homologous 
flanks, i.e., stuffer sequences. It increases the probability the DNA after uptake into the cell 
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will still have both homology boxes flanking the incoming sequence, thereby increasing 
transformation efficiency. 

Mutaqeniztng a region of the Bacillus chromosome 

The present invention provides a process for mutagenizing a region of the Bacillus 
chromosome, an embodiment of which is illustrated in Figure 3 (note, the hatched region 
has been mutagenized). One can amplify a region of the Bacillus chromosome under 
mutagenic conditions and transform the resulting DNA back into Bacillus. If the PGR 
reaction is performed under conditions which favor the introduction of mutations, then one 
obtains a mutant library. Further, the mutagenic PGR product may be assembled with 
homology box and insertion sequences to generate transforming DNA in which only the 
targeted area is mutagenized. To enrich transformants one can introduce a selectable 
marker dose to the target sequence prior to the mutagenesis. Alternatively, if the 
mutagenized region of the chromosome does not carry a selectable marker, a congression 
will enrich for cells also taking up transfomiing DNA . For example, a plasmid bearing a 
selectable marker is co-transfonned with the transforming DNA. The population of cells 
selected for the plasmid marker will be enriched for the presence of insertion sequences. 
Later, the plasmid may be removed from the cell, while maintaining the insertion sequence 
within the chromosome. Lastly, in the absence of selectable marker, the high transformation 
rate permits direct screening of cells for desired transformants. 

In another embodiment, the assembly of long DNA sequences is accomplished in 
situ. Individual DNA constructs are utilized to introduce segments of the final heterologous 
DNA sequence into a target sequence or locus of the host cell. 

Construction of long sequences by iterative marker replacement 

This method, as taught herein and illustrated in Figure 7, provides that one go 
through several steps of in vitro assembly and transfomnation. As a result one can introduce 
many sequences into a particular locus of the Bacillus chromosome. Each round replaces 
the antibiotic marker that was introduced by the previous round. As a result one can repeat 
the process many times and still wori^ with only two antibiotic mariners. 

According to one embodiment, the process comprises the steps of: 

(I) by PGR fusion or other suitable technique one assembles a sequence 
comprising a sequence of interest, a selection mariner and two flanks, which are 
homologous to the target locus; 

(ii) the recipient strain is transfomied with the constructed sequence and one 
selects for resistance to the incoming selection marker, 
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(iii) the transformants are then tested for loss of the residing marker which 
ensures that the construct was incorporated into the correct locus of the chromosome; 

(iv) subsequently, the above cycle can be repeated by reversing the role of the 
incoming and residing markers. 

In another embodiment, the microorganism doesn't possess an endogenous 
selection marker in the first round of transformation and cannot be tested for the loss of a 
residing marker. Thus, after being transfonned the microorganism is screened for the 
incoming selection marker. 

This method allows one to assemble large sequences (e.g., »5kb) in vivo from 
smaller pieces, which can be generated in vitro by PGR fusion or other suitable techniques. 
Only two antibiotic markers are required because each step displaces the marker gene used 
in the previous round. 

The entire resulting construct can be moved between different strains using 
chromosomal transfomnation or transduction. Thus, by way of this method, one can 
accumulate various sequences during the course of a project and retain the ability to 
simultaneously move them into a new strain. 

During the final cloning cycle, one can use a selected gene that is essential for 
growth under some conditions (e.g., synthesis of an amino add, utilization of a certain 
sugar) instead of the incoming mariner. The resulting strain would then be free of any 
antibiotic genes. 

It should be appreciated that the iterative aspect of this method generates value as it 
permits the assembly of large sequences. This method allows one to introduce multiple 
sequences from various sources into a strain (e.g.. bacteria, fungi, eukaryoHc, etc.). This 
method permits one to generate tandem gene repeats as a method for increasing gene copy 
number. This method pemnits one to generate strains containing multiple mutations and 
inserted sequences but no antibiotic markers. 

The methods disclosed herein directed to the assembly of transforming DNA 
constructs may be used to direct the evolution of a sequence or target locus within the host 
cell. Selection of the target sequence allows the design and/or in vitro mutagenesis of the 
target sequence. The mutagenesis of a locus of the host cell, i.e., recipient, chromosome is 
depicted in schematic foma in Figure 3. It should be appreciated that although PGR 
mutagenesis is depicted any in vitiro method of mutagenesis may be used. Thus, the 
depiction of PGR mutagenesis is illusti^tive and not limitative. 

According to one preferred embodiment, the method comprises the following steps: 

1 ) assembling a tiransfonning DNA constiruct; 

2) in vitro mutagenesis of the DNA construct; 
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3) transforming a competent host cell with the mutagenized sequence; 

4) screening for or selecting mutants having a desired property or characteristic; and 

5) repeating steps 1 -4 for one or more rounds. 

In a preferred embodiment the host cell is a Bacillus. In a more prefen-ed embodiment the 
Bacillus is a supercompetent strain. The supercompetent strain is preferably a Bacillus 
canying the Pxyl-comK constaict. 

Identification of Transformants 

Although the presence/absence of marker gene expression suggests that the gene 
of interest is also present, its presence and expression should be confirmed. For example, if 
the nucleic acid encoding a secretion factor is inserted within a marker gene sequence, 
recombinant cells containing the insert can be identified by the absence of mari<er gene 
function. Alternatively, a martcer gene can be placed in tandem with nucleic acid encoding 
the secretion factor under the control of a single promoter. Expression of the marker gene 
in response to induction or selection usually indicates expression of the gene of interest as 
well. 

Alternatively, host cells which contain the coding sequence for a sequence o interest 
and express the protein may be identified by a variety of procedures known to those of skill 
in the art. These procedures include, but are not limited to, DNA-DNA or DNA-RNA 
hybridization and protein bioassay or immunoassay techniques which include membrane- 
based, solution-based, or chip-based technologies for the detection and/or quantification of 
the nucleic acid or protein. 

Other Embodiments 

B. subtilis is a bacteria which is capable of entering sporulation during times of great 
stress in the environment, such as extreme lack of nutrients. Making this decision triggers a 
very elaborate and expensive conversion to the sporulation development state. Over 50 
genes which need to be expressed for sporulation are under the control of eight sporulation 
control genes. These are SpoOA. OB. OE. OF. OH. OJ, OK, and OL. with spoOA being the 
most critical control factor. Mutation in the spomlation control genes allows the cells to 
ignore their environment so that they fail to enter spomlation and continue production of 
heterologous or homologous proteins. A mutation in the oppA gene of the oppA operon has 
been shown to enhance protein production. See WO 00/39323. 

The degU gene of Bacillus subtilis encodes a protein involved in the control of 
expression of different cellular functions, including degradative enzyme synthesis, 
competence for DNA uptake and the presence of flagella. Two classes of mutations have 
been identified in both genes. One class of mutations leads to decreased expression {degU 
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mutations) while the second one leads to enhanced expression [degU{Hy/) mutations] of 
regulated genes, i.e., genes regulated by the degU system. This second class of mutations 
is associated with a pleiotropic phenotype which includes the ability to sporulate in the 
presence of glucose, loss of flagella and decreased genetic competence. 

Many industrially important products, e.g., enzymes, hormones, growth factors, and 
other proteins, are produced from members of the genus Bacilli in large scale fermentation 
processes. Some of these include proteases, lipases, amylases, and beta-glucanases. 
The protein of interest to be expressed may be either homologous or heterologous to the 
host. In the first case overexpression should be read as expression above normal levels in 
said host. In the latter case basically any expression is of course overexpression. Thus, it is 
advantageous to have a cell that will fail to sporulate yet possesses enhanced expression of 
genes of interest. 

An oppA (i.e., spoOK) mutation in combination with a degU(Hy) mutation would 
appear to be ideal for production of a gene of interest. However, it has been shown that 
mutation of the oppA gene results in a decreased competency. See Rudner et al., J. 
Bacteriology (1991) 173:1388-1398. As noted above the degU{Hy) mutant also results in 
decreased competency. Thus, introduction of a gene of interest or other genetic 
manipulation into such a host cell would be significantly more difficult than in the absence of 
such mutation. 

It has been advantageously found that the inventive methods described herein 
overcome this difficulty. Use of a pyxl-comK Bacillus strain overcomes the decreased 
competency exhibited by degU{H\/) oppA' strains. It has been found that the introduction of 
pyxl-comK into Bacillus not only restores competency but the cells are hypercompetent 
relative to wild-type (or parental) cells. Thus, heterologous or homologous sequences may 
be introduced into previously low competency cells. 

Transforming Bacillus with PCR-generated DNA and getting many transformants 
(>100). The methods provided by the present invention allows for the generation of large 
libraries. 

The methods disclosed herein may be used with mutations that enhance 
competence. Employing other mutations to enhance competence, e.g., comS instead of 
comK, mutations to comS homologs and the like are contemplated by the present invention. 

The methods described herein may be used in any microorganism that can be made 
competent. Direct transfomiation in other organisms which can be made competent (like 
Acinetobacter, Themnus, Deinococcus Radiodurans) is contemplated. 
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The methods herein should work for any recombination goal, such as insertions, 
deletions or replacements. Plasmids with temperature sensitive replication would facilitate 
the curing step. Ligating the PGR products to form concatamers are contemplated for 
improving the transformation frequency and allowing smaller homology boxes to be used. 

Have inactive homoloque 

reside in the host to improve transformation efficiencv 

A mutagenesis experiment, in accordance with an embodiment of the present 
invention, is Illustrated in Figure 3. In the illustrated embodiment, the incoming 
(mutagenized) DNA comprises a sequence which shares no homology with the target area 
of the Bacillus chromosome, in such case, a successful chromosomal integration requires 
that both homologous flanks of the incoming DNA align with their respective homologous 
regions of the Bacillus chromosome. The DNA between the two homologous regions is 
required to "bulge" if the incoming DNA differs in its length from the target region of the 
chromosome. As a result, the transfomiation efficiency is diminished. If the target region of 
the Bacillus chromosome is made highly homologous to the entire incoming DNA, then the 
alignment of both sequences becomes more efficient and the overall transformation 
efficiency can be increased (see Figure 4). One preferred way to implement this concept is 
to construct a recipient strain which contains a non-functional mutant of a selectable marker 
(see Figure 5). 

EXAMPLES 

The following examples are illustrative and are not intended to limit the invention. 
Example 1: 

Construction of an integrative plasmid containing 

a xvIR- PxvIA- comfC cassette (plasmid plWComKI) and transformation into Bacillus. 

A fragment containing the xylR repressor gene and the xylA promoter was obtained 
by PGR using primers xylR.2.f (this primer will incorporate a Hindlll site) and xylA,1.r and 
chromosomal DNA from BG168. A second fragment containing the comK gene including 
the first aa codon was obtained by PGR using primers comK.2.f and comK.2.r (this primer 
will incorporate a Xbal site) and same chromosomal DNA. After purification, the fragments 
were fused together by mixing them in a PGR reaction containing the extemal primers 
(xylR.2.f and comk.2.r). A PGR fragment of the expected size was purified, digested with 
Hindlll/Xbal and ligated into the integration vector pJM103 (Kapp, Edwards et al., 1990) 
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(containing carbenicillin and chloramphenicol resistance genes as markers) digested with 
the same restriction enzymes. Ligation products were transfomied into MN296 E. coli cells, 
colonies were selected on 50ug carbenicillin, plasmid DNA was isolated and screened for 
the 2.1 kbp xylR-PxylA-comK insert by DNA digest. The plasmid was integrated into B. 
subtilis. The resulting strain was grown overnight in L-broth medium, diluted to 1 ODeoo in L- 
broth containing 1% xylose and grown 2 hours with shaking to induce comK expression. 
The resulting process produced a population of cells in which greater than 1% of cells are 
transformed by bacillus chromosomal DNA containing a marker, indicating that these cells 
were super competent. Cells were considered supercompetent if greater than 10% of the 
cells were transformable with a Bacillus self-replicating plasmid. These cells were utilized 
in the following examples. 

The primer sequences used were as follows; 
xylR.2.f (SEQIDN0:1) 

GCGCGCAAGCTTTGCTTCAGAAATACTCCTAGAATAAAAAAACTC 
xylA.1.r (SEQ ID NO : 2) 

GGTGCGTCTGTTTTCTGACTCATGTGATTTCCCCCTTAAAAATAAATTCA 
comK.2.f (SEQ ID NO: 3) 

TGAATTTAl I I I I AAGGGGGAAATCACATGAGTCAGAAAACAGACGCACC 
comk.Zr (SEQ ID NO: 4) 

GCGCGCTCTAGAGGTATATGGCATCACCGGAGGAATTCCG 
Example 2: 

Mutagenesis of the subtiiisin gene using Z-Taa polymerase 

This Example describes an exemplary method to randomly mutagenize a large DNA 
fragment, containing a gene of interest (e.g. subtiiisin gene) with an antibiotic marker and 
approximately 2kb of homologous DNA on either side of the subtiiisin gene. In this specific 
example, the mutagenized DNA fragment is 6.8kb long comprising a left homology box 
(approx. 2.2 kb), the gene of interest and selectable marker (approx. 2.4 kb). and a right 
homology box (approx. 2.1kb). See Figure 9. 

Chromosomal DNA of Bacillus was extracted from an ovemfght culture of cells grown 
on semi solid nutrient agar plates (LA) + chloramphenicol plates. Usually three colonies 
from the ovemight plate were resuspended into 0.1 ml of SMM medium (0.5M sucrose, 
0.02M sodium maleate. 0.02M magnesium chloride-6H20, pH 6.5) containing lysozyme 
(100.000 U). The cell suspension was incubated for 30 minutes at 37*C with shaking. An 


wo 02/14490 


PCT/USOl/25166 


-19- 

additional 1 ml of SMM was added to the cells and the suspension microfuged for 1 .5 
minutes. The supernatant was removed and the step repeated. Finally the cell pellet was 
resuspended in 10 mM Tris (pH 8.0) and 0.5 mM EDTA, vigorously vortexed for 30 seconds 
and the sample was frozen at -20'*C. 

For PGR mutagenesis, a 100ul PGR reaction was set up using the Z-Taq 
polymerase kit (TaKaRa Shuzo Go., Ltd.). A typical reaction mixture contained 0.25uM of 
both primers, 125uM of Z-Taq dNTP mixture, 5-1 Ong of the chromosomal DNA, 2.5U of Z- 
Taq polymerase, IX Z-Taq polymerase buffer. The PGR amplification parameters were: 
98°C for 10sec (first cycle only) followed by 98"G for 5sec, 58°G for 10sec 72*'C for 2.5 
minutes. The PGR reaction was run for a total of 30 cycles. The primer sequences to 
amplify the 6.8Kb fragment were as follows: 

Primer 1 ATATGTGGTGGGGAAAGGCTGTGGGGTAAC (SEQ ID NO: 5) 
Primer 6 CTTTTCTTCATGCGCGGTCAGG I I I I IGTG (SEQ ID NO: 1 0) 

After the amplification process, the PGR products were analyzed on an agarose gel. 
For a typical PGR reaction, the limited amount of dNTP used yielded approximately 1 5ug of 
DNA. The mutagenized DNA was then transformed into Pxyl-comK Bacillus strains to 
generate a library. 

Example 3: 

Random mutagenesis of the signal sequence and propeptide of subtilisin 

This Example provides an exemplary method for randomly mutagenizing the signal 
sequence and propeptide of subtilisin. 

Primers used in the random mutagenesis reactions were as follows : 

1 ATATGTGGTGGGGAAAGGGTGTGGGGTAAG (SEQ ID NO: 5) 

2 GAGTTAGTTAAAAGAGTATTGTGTGATGGAGGTGGAATG (SEQ ID NO: 6) 

3 GATTGCAGGTGGATGAGAGAATAGTGTTTTAAGTAAGTG (SEQ ID NO: 7) 

4 GTAATTGGGGATGGGAGTGATTGGGG (SEQ ID NO: 8) 

5 GCGCAATCAGTGGGATGGGGAATTAG (SEQ ID NO: 9) 

6 GTTTTGTTGATGGGGGGTGAGGTTTTTCTC (SEQ ID NO: 10) 

To randomly mutagenize the signal sequence and propeptide of subtilisin gene, a 
PGR reaction using Primers 1 and 2 generated the 2.2 Kb left flanking region. Primers 3 and 
4 were used to mutate a 646bp region comprising of the signal sequence and propeptide 
region. Primers 5 and 6 were used to generate the 3.9kb right flanking region. Primers 2 & 3 
are complementary to one another, as are primers 4 & 5. See Figure 10. A typical 
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amplification reaction (100ul) was set up using either 0.5uM of Primers 1 and 2 (for the 
2.2Kb fragment) or 0.5uM of Primers 5 and 6 (for the 3.9Kb fragment) and 200uM of dNTP, 
2ul of log phase liquid culture grown to ODeoo^O.S (source of Bacillus chromosomal DNA), 
4U rTth XL polymerase. 1.25U Pfu Turbo DNA polymerase, 1X rTth XL polymerase buffer 
and 1.1mM Mg (OAc)2. 

The amplification parameters for the 2.2Kb and 3.9Kb fragments were: 95X for 
3min, 95**C for 30sec, 54°C for 30sec, and GSX for 2min for a total of 30 cycles. 

The PGR reaction products were analyzed on an agarose gel. If the con-ect size 
fragment was seen then the PGR product was purified using the QIAquick PGR Purification 
Kit. 

The 646bp fragment for mutagenizing the maturation site was amplified using 
Primers 3 and 4 (0.5uM each), 33ul 3x dNTP, 2ul of liquid culture grown to OD6oo=0.5 
(source of Bacillus chromosomal DNA), 0-0.3mM MnCI2 (varies upon the rate of 
mutagenesis desired), 5.5mM MgGb, 5U Taq polymerase, 1X Taq polymerase buffer in a 
100ul reaction. The PGR amplification parameters were as follows: 95**G for 30sec, 54°C for 
30sec, and 68°C for 30sec for a total of 30 cycles. The PGR reaction products were 
analyzed on an agarose gel. If the connect size fragment was seen, the PGR product was 
purified using the QIAquick PGR Purification Kit. 

The assembly of the enfire 6.8kb fragment containing the mutagenized maturation 
site was done using 3-5ul each of 646bp, 2.2kb, and 3.9kb fragments, 0.5uM each of 
Primers 1 and 6, 300uM of dNTP, 4U of rTth XL polymerase. 1.25U Pfu of Turbo DNA 
polymerase, 1X rTth XL polymerase buffer, and 1.1 mM Mg (0Ac)2in a 100ul reaction. The 
parameters for the assembly reaction were as follows: 95**C for 30sec, 48-50°G for 30sec. 
and es^'G for 7min for a total of 30 cycles. The PGR reacfion products were analyzed on an 
agarose gel. If the correct size fragment was seen, the PGR product was transfonned into 
Pxyl-comK Bacillus strains to generate a library. A total of 9,000 transfonnants were 
obtained. 

Example 4: 

Increasing the efficiency of transformation 

by adding non-homologous flanks to the transforming DNA 

This Example provides an exemplary method to increase the transformation 
efficiency of Bacillus for obtaining larger libraries. Although this example utilizes a plasmid 
that is amplified in E. coli, one skilled in the art will recognize any method that results in the 
addition of non-homologous flanks may be used with the present invention. The use of E. 
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coli in the present example was a rapid and simple means for adding non-homologous 
flanks and should not be construed as limiting. 

Figure 9 shows a schematic of the DNA constmct used for the present example. 
Primers 1 and 6 were used to generate the 6.8Kb DNA fragment. A typical PGR reaction 
(100ul) contained 0.25uM each of Primers 1 and 6, 300uM of dNTP, 5-1 Ong chromosomal 
DNA, 2.5U of Pfu Turbo DNA polymerase (Stratagene), and 1.5X of Pfu Turbo DNA 
polymerase buffer. The PGR amplification parameters were as follows: 95**C for 30sec, 
54'*C for 30sec, and 68*G for 7min for a total of 30 cycles. The PGR reaction products were 
analyzed on an agarose gel. If the correct size fragments were seen, the 6.8Kb DNA 
fragment was cloned into the TOPO vector following the manufacturers protocol (Invitrogen), 
The vector was then transformed into TOP 10 E. coli competent cells. 

A 10.3Kb fragment was generated as shown in Figure 1 1 . Plasmid DNA was 
prepared from the transformed E. coli cells using the QIAprep Spin Miniprep to obtain lots of 
DNA. The plasmid DNA was digested with Xma 1 restriction endonuclease (no Xma I site is 
present in the 6.8kb DNA fragment) to linearize the vector. 

The non-homologous flanks were derived from the TOPO cloning vector and were of 
E. coli based plasmid origin; therefore, the sequences were not expected to have any 
significant homology to regions in the Bacillus chromosome. 

Transformation efficiency of Pxyl-comK Bacillus competent cells for the two 
constructs: without (6.8Kb fragment) and with (10.3Kb fragment) the non-homologous 
flanking sequences was compared. 

Transformation with 2.2x10'^"^ moles DNA (approximately 1ug/ml) of the 6.8kb DNA 
fragment (i.e. without the non-homologous flanks) yielded approximately 3.2x10"^ cfu/ml 
(0.01% transformation efficiency). Transfomnation with 2.2x10'^"* moles DNA of the 10.3Kb 
linearized fragment (i.e. with the non-homologous flanks) yielded approximately 7.2x10^ 
cfu/ml (0.25% transformation efficiency). 

As an alternative to using the TOPO cloning kit from Invitrogen, one could also ligate 
the 6.8kb PGR product to itself. The multimerized DNA can then be transfonned into Pxyl- 
comK Bacillus strains to generate a library. 

Example 5: 

Optimizing Double Cross Over 

Integrations bv varying the size of the homology box 

This Example provides an exemplary method to evaluate transfomnation efficiency of 
Bacillus as a function of varying the size of the homology box and stuffer sequence. 
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Using primers of varying lengths that contained flanks con-esponding to 100, 200, 
400, 800. and 1600 bp homology boxes, a series of PGR fragments were generated 
containing genes coding for a protease, a selectable marker (CAT) and increasing amounts 
of flanking chromosome sequence. The DNA construct is shown in schematic form in 
Figure 8A. 


The various primers used for the amplification reaction were as follows: 


HB size 

Forward Primer 

Reverse Primer 

100 

CCTTGCAAATCGGATGCCTG 

CGCTGIIAI IGCI 1 1 IGI 1 1 TCI 


(SEQ ID NO: 11) 

GT (SEQ ID NO: 12) 

200 

GTTGGATAGAGCTGGGTAAAG 

CGCCGGAI 1 1 lATGTCATTGATA 


CO (SEQ ID NO: 13) 

A (SEQ ID NO: 14) 

400 

AGCCGI 1 1 IGCTCATACAAGC 

TGAAGTGAACATGTCAGAAA 


TT (SEQ ID NO: 15) 

(SEQ ID NO: 16) 

800 

ATAGCTTGTCGCGATCACCT 

1 1 II 1 GCAGACCGTTGGTTT 


(SEQ ID NO: 17) 

(SEQ ID NO: 18) 

1600 

CGCGACACAGCAGTTCAGCA 

TATCAI 1 1 1 IGCTTAATTTG 


(SEQ ID NO: 19) 

(SEQ ID NO: 20) 


A typical PGR reaction (100ul) contained 0.25uM of Forward and Reverse Primers 
each. 300uM of dNTP. 5-1 Ong 6.8Kb DNA fragment generated in Example 4, 2.5U Pfu 
TurboDNA polymerase, and 1.5X Pfu Turbo DNA polymerase buffer. The cycling conditions 
for.producing DNA fragments with different sized homology boxes were as follows: 95**G. 30 
sec; 52**G, 30 sec, and 68**G for 3 to 6 minutes for a total of 30 cycles (extension times 
depended on the expected product length, the rule being 1000 bp/min). 

An aliquot of this reaction was saved for the direct transformation into Bacillus, while 
the rest was cloned into the Zero Blunt TOPO vector following manufacturer directions 
(Invitrogen). The cloned fragments were transformed into competent £ coli cells and 
plasmid DNA prepared. 

Figure 8B shows the transformation efficiency for various sized homology boxes in 
either uncut plasmid, linear plasmid or PGR product (no plasmid). Transfomnation efficiency 
increases as the homology box size increases for each DNA construct tested. 0.2 ug of 
uncut plasmid (closed circle), linear plasmid {sfi\ at 50C for 5 hrs, open circle), or the PGR 
products (direct transformation, cross) were transfomied into 0.2 ml competent OS22.9 
Bacillus cells and colonies on solid L-agar plates with 10 ug/ml chloramphenicol were 
counted in order to estimate transfomnation efficiency. To confirm that the majority of 
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transformants were double cross over integrations, chromosome DNA from twenty randomly 
selected clones was amplified using primers flanking the homology box. these products 
indicated the selected clones had inserts generated by double crossover. 

The effect of homology box size on transformation efficiency was measured (Figure 
8B). Transformation efficiency was proportional to homology box size. 

Experimental Discussion 

Part of the improvement was due to having a larger length of DNA because the 
efficiency of transformation jumped over 10-foId when the PGR fragment was cloned into a 
vector. By cloning into a vector, the integrating DNA is flanked by stuffer sequence; 
presumably this stuffer sequence reduces the chance that the Bacillus DNA transporter will 
initiate in sequences between the homology boxes. 

Example 6: 

Site Directed Mutagenesis using QuikChanqe 

This example describes an exemplary method to perform site directed mutagenesis 
on the gene of interest and directly transform Bacillus strains with the mutagenized DNA. 

Site-saturation libraries were created by PGR at 3 different sites in the gene of 
interest (in this case protease) by using QuikChange (Stratagene) 

The primers used were as follows: 

Primer A: GAAGAGGATGGAGAANNSAGGAGAATGGCGCAATC (SEQ ID NO: 21 ) 
Primer B: GATTGCGCCATTGTGGTSNNTTCTGCATGGTCTTC (SEQ ID NO: 22) 
Primer C: GAGGATGGAGAAGTANNSACAATGGGGCAATCAG (SEQ ID NO: 23) 
Primer D: GTGATTGCGCCATTGTSNNTACTTGTGCATGGTC (SEQ ID NO: 24) 
Primer E: GATGCAGAAGTAACGNNSATGGCGCAATCAGTG (SEQ ID NO: 25) 
Primer F: GAGTGATTGCGCCATSNNCGTTACTTGTGCATC (SEQ ID NO: 26) 

Three separate PGR reactions were set up using primer pairs A&B, C&D and E&F. A 
typical PGR reaction (100ul) contained 1X Pfu Buffer, 1.5ul 10mM dNTPs, 1ul of 25uM 
primer, 1ul Pfu Turbo DNA polymerase, 200ng of plasmid DNA. The cycling conditions were: 
QS'^G for 35 seconds for one cycle; (95°G for 35 seconds, 50X for 1 minute, 68°G for 16.5 
minutes) for 16 cycles, and es^'G for 7 minutes 

The expected 7.8Kb band was identified on the agarose gel (-lOOng/ul). The PGR 
products were digested with 1 uL Dpnl at 37C for an hour to eliminate the pME03 template. 
The digestion reaction was spiked with another 1 ul of Dpnl and digested for another hour. A 
mock PGR reaction that did not undergo the PGR amplification was also digested to see 
how well Dpnl worths to get rid of the template DNA (template control). 
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A supercompetent Bacillus strain was directly transformed with the digested 
products. About 200ng of the library was incubated with lOOul of OD 600=0.5. The reaction 
was incubated at 37*C for 1.5 hours with shaking. Two transformations were set up for each 
of the conditions, which included the three-mutagenesis reactions (with A&B, C&D, and 
E&F), a template control, a parent vector control and no DNA condition. Serial dilutions of 
the cell suspension were streaked on selection plates and following 0/N incubation; the 
transformation efficiency was computed from the number of colonies obtained. 

Transformation efficiency as follows: 


DNA source 

Colonies/ug 

• A&B 

280 

C&D 

305 

E&F 

405 

Template control 

0 

Parent vector 

2.50E+05 

No DNA 

0 


Example 7: 

Direct transformation of ligated product. 

This example provides an exemplary method of mutagenizing the gene of interest 
with en-or prone PGR (fonms separate PGR products which can be annealed together) and 
directly transforming the ligated product into Bacillus strain. 

Generation of the vector 

The source of the vector DNA was the 800bp homology box plasmid described in 
Example 5. Bbs I sites were incorporated into this vector and 20ug of the plasmid was 
digested overnight at 37*C in New England Biolabs Buffer 2 with Bbs I to generate the 
vector with flanking sites. See Figure 12. 

Preparation of insert 

Insert DNA was generated from annealing two overlapping error prone PGR 
products. See Figure 13. The primer sets used for the PGR were: 

PI GTCTGAAI II I t I lAAAAGGAGAGGGTAAAG (SEQ ID NO: 27) 
P2 AATTGGCCAT GGTACGG ATTGGG (SEQ ID NO: 28) 
P3 TGTAGTGTGAAI I I 1 1 I lAAAAGGAGAGGGTAAAG (SEQ ID NO: 29) 
P4 CGGGATGGTACCGATTGCG (SEQ ID NO: 30) 

En-or prone PGR products were fomied by both sets of primers (P1&P2 [solid line 
product] and P3&P4 [hatched line product]) using conditions described in Example 3, with 
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cycling cx)nditions 94**C for 1min, 50°C for 1min, 68X for 2min, for 30 cycles. Negative 
control was a reaction without MnCl2. PGR products {330bp) were purified using Qiaquick 
PGR columns and pure DNA was pooled together. 

For annealing of the two products, 1.3ug of each was combined and heated at 95**G 
for 5 min then allowed to cool to room temp in heat block. Only one out of four annealed 
products were expected to ligate properly with vector con-ectly. 

Ligation 

A 1 :5 Molar ratio (vectoninsert) was used for ligation (DNA ligation kit from PanVera 
TAK6021 ) with a total of 440ng of DNA in the reaction mixture {10ul of vector+ insert DNA + 
1 0ul of Takara Biomedicals Ligase solution). This 1 :5 ratio was to the insert of interest (1 out 
of 4 of the reannealed products) so overall it was actually a 1:20 ratio (vectonannealed PGR 
product). Appropriate DNA controls were also made. The ligation reaction was incubated for 
1hr at 16**C. Incubating the reaction mixtures at 65°G for 15 minutes inactivated the ligase. 
The incompletely digested template was destroyed by incubating the ligation mixture with 
1000U of Bbs I in NEB2 buffer at 37°C for 2h. This mixture was then used for Bacillus 
transformation. 

Transformation 

5ul (55ng) of the 440ng ligation was transformed into 200ul of Bacillus competent 
cells. The cell suspension was shaken vigorously 1hr at 37^G. One hundred ul of serial 
dilutions of the cell suspension was plated on selection plates. 129,000 CFU/ug ligation 
mixtures were obtained, useful for combinatorial library construction. Ligation conditions 
produced large tandem repeats, which facilitated Bacillus transfomiation. See Figure 14 
(photo). Lane 1 depicts large, low mobility ligation products, Lane 2 depicts mobility of 
unligated vector. Lane 3 depicts molecular weight standards. 


Lane 

DNA 

CFU/ug 

1 

Ligated DNA 

1.3e5 

2 

Linear vector 

0 


DNA 


3 

1 KB ladder 



Example 8: 

Markerless deletion by insertion 

This example demonstrates the deletion of the metB gene of Bacillus. A PGR 
product was generated from sequences that flank the met B gene. This product and a 
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replicating Bacillus plasmid were ccHtransformed into the competent Bacillus, and cells 
resistant to the antibiotic marker on the plasmid were selected. These cells were screened 
for the metB deletion by methionine auxotrophy and absence of metB sequence from a PGR 
product. 

Preparation of insert 

PGR with 100f/r (Primers N1 and N2 in Figure 15) produced a 3958bp and 101f/r 
(Primers N3 and N4 in Figure 15) produced a 3451 bp. When fused together, a 7409 bp 
fragment is generated that is deleted for nucleotides 1-621 of metB (full length metB is 672 
bp; thus, this is not a full deletion). See Figure 15. 


Primers 

Primer sequence 

N1 

AAATGAAGCGCTCCTTCTTTCTTCG (SEQIDNO: 31) 

N2 

GCTTCCI 1 H3AIGCGGTAAGAATGTTTACGTGCCACCTCCATTATTTCCCCG 
(SEQIDNO: 32) 

N3 

CGGGGAAATAATGGAGGTGGGACGTAAACATTCTTACCGCATCAAAGGAAGC 
(SEQIDNO: 33) 

N4 

GAGCTTGCTCAAGAGCCTGATGACA (SEQ ID NO : 34) 


The amplification used 0.5uM of primer pairs N1/N2 or N3/N4, 300uM of dNTP, 200 
ng Badllus chromosome DNA, 5U Herculase (Stratagene) and 1x Herculase buffer 
(Stratagene) in a 50 ul reaction volume. 

The amplification parameters were: 94°C for 3min, 94"C for 30sec, 54°G for 30sec, 
and 68X for 7.1 min for a total of 30 cycles. PGR products were purified using the QIAquick 
PGR Purification Kit. 

The assembly of the entire 7.4 kb fragment containing the mutagenized maturation 
site was done using 100 ng of each PGR fragment. 0.5uM each of Primers N1 & N4, 300uM 
of dNTP, 5U Herculase (Stratagene) and 1x Herculase buffer (Stratagene) in a 100 ul 
reaction volume. The parameters for the assembly reaction were as follows: 95X for 30sec, 
55*G for 30sec, and 68^C for 7min for a total of 30 cycles. The PGR reaction products were 
analyzed on an agarose gel. 

Transformation 

Transform 500ng of the PGR fusion product along with 50ng of Bacillus replicating 
plasmid DNA ( provides chloramphenicol resistance) into lOOuL of hypercompetent Bacillus 
cells and plated on nutrient agar plates containing chloramphenicol (5y) plates. The 
resulting colonies were screened for methionine auxotrophy and PGR for deletion of metB 
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gene. This method produced >900 recombinant deletions per microgram of transfomiatlon 
mix (>6% of chloramphenicol resistant colonies). 

Various other examples and modifications of the foregoing description and examples 
will be apparent to a person skilled in the art after reading the disclosure without departing 
5 from the spirit and scope of the invention, and it is intended that ail such examples or 
modifications be included within the scope of the appended claims. All publications and 
patents referenced herein are hereby incorporated by reference in their entirety. 
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CLAIMS 

1. A method of producing a transformed microorganism, comprising: 

(i) selecting a competent microorganism; 

(ii) producing a DNA construct in vitro] and 

(ill) directly transforming said microorganism with said DNA construct 
such that the DNA construct becomes integrated into a chromosome of said 
microorganism. 

2. The method of claim 1 , wherein said microorganism is selected from the group 
consisting of Acinetobacter, Themnus, Deinococcus, Radiodurans and Bacillus. 

3. The method of claim 2, wherein said microorganism is a Bacillus. 

4. The method of claim 3, wherein said Bacillus is a super-competent strain. 

5. The method of claim 4, wherein said super-competent Bacillus is a Pxyl-comK 
strain. 

6. The method of claim 1 , wherein said DNA construct comprises homologous 
DNA selected from the group consisting of wild-type, mutagenized and modified DNA. 

7. The method of claim 1 , wherein said DNA construct comprises heterologous 
DNA selected from the group consisting of wild-type, mutagenized and modified DNA. 

8. The method of claim 1 , wherein said DNA construct comprises an incoming 
sequence sequence of interest, flanked on each side by a homology box. 

9. The method of claim 8, wherein said DNA construct further comprises stuffer 
sequences. 

10. The method of claim 1 . wherein said DNA construct is a non-plasmid DNA 
construct. 

1 1 . The method of claim 1 wherein the DNA construct is produced without the use 
of a shuttle vector or an intemriediate host. 
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12. The method of claim 1 . further comprising the steps of selecting a target 
sequence in a chromosome of said competent microorganism, and increasing the 
homology between said target sequence and said DNA construct. 

1 3. A library of mutants produced by the method of claim 1 . 

14. Directed evolution of a sequence in the host cell chromosome, comprising: 

(i) in vitro mutagenesis of a DNA construct, 

(ii) direct transformation of the mutagenized sequence into a competent 
host cell, 

(iii) screening for. or selection of, mutants possessing or exhibiting a 
desired property, and 

(iv) repeating steps OHiii) for one or more rounds, 

15. The method of claim 14 wherein the host cell is a Bacillus. 

1 6. The method of claim 1 5 wherein the Bacillus is a Pxyl-comK strain. 

1 7. The method of claim 14, canied out so as to evolve single-copy genes of a 
competent Bacillus strain. 

1 8. A method for constructing a sequence of interest at a target sequence of a 
selected microorganism, wherein said target sequence includes a residing mariner, 
said method comprising the steps of: 

(!) assembling a DNA construct in vitro comprising an incoming sequence, a 
selectable marker, and two flanking sequences which are homologous to sequences of 
said target sequence, wherein said selectable mari<er of the DNA construct is different than 
the residing marker of the microorganism; 

(ii) transforming said microorganism with the DNA construct under conditions 
permitting the incoming sequence and selectable marker to inactivate the residing marker, 
and selecting for transfonnants that include the selectable marker; 

(iii) repeating steps (i) and (ii) wherein with each repetition of said steps the 
DNA construct comprises a selectable mariner different from the selectable mariner in the 
previous step and the selectable mari<er of said previous step acts as the residing marker 
in said microorganism. 
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19. The method of claim 18, further comprising, after step (ii), the step of: 
testing the transformants for loss of the residing marker, thereby verifying that the 
construct was incorporated into the con-ect locus of the chromosome. 
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Figure 8B. Optimization of homology box size 
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Figure 1 1 


6.8kb PGR 


Zero Blunt PGR 
TOPO vector (3.5kb) 


6.8kb PGR product 


10.3kb total vector 


Xma I site 


Non-homologous flank 

Left HB+Subtilisin+Marker+Right HB 

Non-homogous flank 

{~2kb) 

(~6.8kb) 

(~1.5kb) 
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Figure 12 


Bbsl Bbsl 


Vector 


wo 02/14490 


14/16 


PCT/USOl/25166 


Figure 13 


PGR 


Denature/reanneal 
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metB 
(672bp) 

N1 N3 M4 


N2 


Figure 1 5 


